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Program History in a Computer Programming Language 

5 TECHNICAL FIELD 

This invention relates generally to computers, and more particularly to languages for 

programming computers. 

COPYRIGHT NOTICE/PERMISSION 

10 A portion of the disclosure of this patent document contains material that is subject to 

copyright protection. The copyright owner has no objection to the facsimile reproduction by 
anyone of the patent document or the patent disclosure as it appears in the Patent and 
Trademark Office patent file or records, but otherwise reserves all copyright rights 
whatsoever. The following notice applies to the software and data as described below and in 

15 the drawings hereto: Copyright © Microsoft Corporation, 2000. All Rights Reserved. 

BACKGROUND OF THE INVENTION 

A computer operates under the control of programs consisting of coded instructions 
called "object code" that execute on the computer. But, object code is not readily understood 

20 by humans, so a human programmer typically writes programs in a high-level programming 
language, such as BASIC, JAVA, PERL, PASCAL, C, C++, or the like, which are easier to 
understand. High-level languages generally have a precise syntax that defines certain 
permitted structures for statements in the language and their meaning. A translator such as a 
compiler translates the high-level language statements, called "source code" into object code. 

25 Thus, the terms "source code" and "object code" describe the form of a program prior to and 
after translation, respectively. Accordingly, the term "source code" generally refers to a 
program in its high-level programming language form. "Object code," on the other hand, 
generally refers to the program in the form of the coded instructions generated by the 
translator that are executable on a computer. 

30 
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Typically, this translator is itself a program, which might be a compiler or an 
interpreter. A compiler is a program that translates all the source code into target code, which 
can be object code or byte code, prior to execution on the computer. An interpreter is a 
program that executes the source code by translating source statements into an "intermediate 
5 form" or data structure that is subsequently executed. The intermediate form is neither source 
code nor object code, but rather a structure that the interpreter can execute indirectly. The 
interpreter's translation into this intermediate form may be done at any granularity, from one 
statement at a time as needed to execute, all the way to a whole program at a time. 

10 Writing computer programs, even in high-level source code, is a time-consuming and 

laborious process for the computer programmer. For many programs and programmers, the 
time to write a program is the most critical consideration. But, prior programming languages 
have significantly under-emphasized programmer convenience in language design. For 
example, many programming languages require the programmer to write code that merely 

15 performs bookkeeping functions, such as counting the number of times that some action has 
been taken. This bookkeeping code is tedious to write and prone to errors. Further, this 
bookkeeping code distracts the programmer from the main tasks of the program and lowers 
the programmer's productivity. 

20 Thus, there is a need for programming language features that make writing programs 

easier and more convenient, in order to boost programmer productivity. 

SUMMARY OF THE INVENTION 

The above-mentioned shortcomings, disadvantages and problems are addressed by the 
25 present invention, which will be understood by reading and studying the following 

specification. The present invention encompasses a programming-language construct called a 
program history, and a method, system, apparatus, and data structure for translating computer 
source code that contains the program-history construct. A program history captures data 
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regarding the state of the program as it executes and makes this data available to functions 
called history operators, which operate on the data. 

The source code contains a history operand, instances of the history operand, and the 
5 history operator. In one aspect, the presence of the history operand directs a translator to 
generate object code that will save history data regarding the sequence of data that is 
associated with the operand instances as the program executes. The presence of the history 
operator directs the translator to generate object code that will perform a function on that 
saved history data. 

10 

In another aspect, the presence of the history operand directs a translator to save 
history data regarding the sequence of data that is associated with the operand instances as the 
program executes. The presence of the history operator directs the translator to perform a 
function on that saved history data. 

15 

In this way, the programmer is freed from writing tedious and error-prone 
bookkeeping code that requires variables and saves or computes history data into those 
variables, 

20 BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a block diagram of the hardware and operating environment in conjunction 
with which embodiments of the invention may be practiced. 

Fig. 2 is a block diagram illustrating a system-level overview of an exemplary 
25 embodiment of the invention. 

Fig. 3 A is a block diagram of a primary data structure of an exemplary embodiment of 
the invention. 
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Fig. 3B is a block diagram of a primary data structure of an exemplary embodiment of 
the invention. 



Fig. 4 is a block diagram that illustrates an example of using a program history to 
5 average the values in a list. 

Fig. 5 is a block diagram that illustrates an example of using a program history to 
count iterations in a programming loop. 

10 Fig. 6 is a block diagram that illustrates an example of using a program history to 

compute the maximum value of an element of an array. 

„;! Fig. 7 is a block diagram that illustrates an example of using a program history to print 

Ul out a list of values read from input. 

m 15 

!t -* Fig. 8 is a block diagram that illustrates various examples of using a program history. 

!'i i Fig. 9 is a block diagram that illustrates an example of using a program history to 

count the number of times that program execution has visited a particular location identified 

y 

□ 20 by a user-defined label. 

Fig. 10 is a block diagram that illustrates an example of using a program history to 
count the number of times that program execution has visited a particular location using a 
user-defined label. 

25 

Fig. 1 1 is a block diagram that illustrates an example of using a program history to 
limit loop iterations to avoid diverging computations. 
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Fig. 12 is a block diagram that illustrates an example of using a program history 
restricted to a specified call site. 



Fig. 13 is a flowchart illustrating processing flow of an exemplary embodiment. 

5 

DETAILED DESCRIPTION OF THE INVENTION 

In the following detailed description of exemplary embodiments of the invention, 
reference is made to the accompanying drawings that form a part hereof, and in which is 
shown by way of illustration specific exemplary embodiments in which the invention may be 

10 practiced. These embodiments are described in sufficient detail to enable those skilled in the 
art to practice the invention, and it is to be understood that other embodiments may be utilized 
and that logical, mechanical, electrical and other changes may be made without departing 
from the spirit or scope of the present invention. The following detailed description is, 
therefore, not to be taken in a limiting sense, and the scope of the present invention is defined 

1 5 only by the appended claims. 

Introduction 

The invention includes a new programming-language feature called a program history, 
which makes writing programs more convenient by eliminating tedious bookkeeping code in 
20 programs. A program history captures aspects of past program states implicitly, thereby 
freeing the programmer from many error-prone and program-cluttering bookkeeping chores. 
Many different kinds of program histories can be implemented with low overhead, as 
illustrated below. 



25 Hardware and Operating Environment 

Fig. 1 is a diagram of the hardware and operating environment in conjunction with 

which embodiments of the invention may be practiced. The description of Fig. 1 is intended 

to provide a brief, general description of suitable computer hardware and a suitable computing 

environment in conjunction with which the invention may be implemented. Although not 

5 
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required, the invention is described in the general context of computer-executable 
instructions, such as program modules, being executed by a computer, such as a personal 
computer. Generally, program modules include routines, programs, objects, components, data 
structures, etc. that perform particular tasks or implement particular abstract data types. 

5 

Moreover, those skilled in the art will appreciate that the invention may be practiced 
with other computer system configurations, including hand-held devices, multiprocessor 
systems, microprocessor-based or programmable consumer electronics, network PCs, 
minicomputers, mainframe computers, and the like. The invention may also be practiced in 
10 distributed computing environments where tasks are performed by remote processing devices 
that are linked through a communications network. In a distributed computing environment, 
program modules may be located in both local and remote memory storage devices. 

The exemplary hardware and operating environment of Fig. 1 for implementing the 
15 invention includes a general purpose computing device in the form of a computer 20, 

including a processing unit 21, a system memory 22, and a system bus 23 that operatively 
couples various system components include the system memory to the processing unit 21. 
There may be only one or there may be more than one processing unit 21, such that the 
processor of computer 20 comprises a single central-processing unit (CPU), or a plurality of 
20 processing units, commonly referred to as a parallel-processing environment. The computer 
20 may be a conventional computer, a distributed computer, or any other type of computer; 
the invention is not so limited. 



The system bus 23 may be any of several types of bus structures including a memory 

25 bus or memory controller, a peripheral bus, and a local bus using any of a variety of bus 

architectures. The system memory may also be referred to as simply the memory, and 

includes read only memory (ROM) 24 and random access memory (RAM) 25. A basic 

input/output system (BIOS) 26, containing the basic routines that help to transfer information 

between elements within the computer 20, such as during start-up, is stored in ROM 24. The 

6 
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computer 20 further includes a hard disk drive 27 for reading from and writing to a hard disk, 
not shown, a magnetic disk drive 28 for reading from or writing to a removable magnetic disk 
29, and an optical disk drive 30 for reading from or writing to a removable optical disk 31 
such as a CD ROM or other optical media. 

The hard disk drive 27, magnetic disk drive 28, and optical disk drive 30 are 
connected to the system bus 23 by a hard disk drive interface 32, a magnetic disk drive 
interface 33, and an optical disk drive interface 34, respectively. The drives and their 
associated computer-readable media provide nonvolatile storage of computer-readable 
instructions, data structures, program modules and other data for the computer 20. It should 
be appreciated by those skilled in the art that any type of computer-readable media which can 
store data that is accessible by a computer, such as magnetic cassettes, flash memory cards, 
digital video disks, Bernoulli cartridges, random access memories (RAMs), read only 
memories (ROMs), and the like, may be used in the exemplary operating environment. 

As will be described in detail below, aspects of an embodiment pertain to specific 
method steps implementable on computers. In another embodiment, the invention can be 
implemented as a computer program product for use with a computer system. The programs 
defining the functions of the embodiment can be delivered to a computer via a variety of 
signal-bearing media, which include, but are not limited to: 

(1) information permanently stored on non-writeable computer-readable media; 

(2) alterable information stored on writeable computer-readable media; or 

(3) information conveyed to a computer by a communications media, such as through 
a computer or telephone network including wireless communications. 

Such signal-bearing media, when carrying computer-readable instructions that direct 
the functions of the present invention, represent embodiments of the present invention. 

A number of program modules may be stored on the hard disk, magnetic disk 29, 

optical disk 31, ROM 24, or RAM 25, including an operating system 35, one or more 

7 
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application programs 36, other program modules 37, and program data 38. A user may enter 
commands and information into the personal computer 20 through input devices such as a 
keyboard 40 and pointing device 42. Other input devices (not shown) may include a 
microphone, joystick, game pad, satellite dish, scanner, or the like. These and other input 
5 devices are often connected to the processing unit 21 through a serial port interface 46 that is 
coupled to the system bus, but may be connected by other interfaces, such as a parallel port, 
game port, or a universal serial bus (USB). A monitor 47 or other type of display device is 
also connected to the system bus 23 via an interface, such as a video adapter 48. In addition 
to the monitor, computers typically include other peripheral output devices (not shown), such 
10 as speakers and printers. 

The computer 20 may operate in a networked environment using logical connections 
to one or more remote computers, such as remote computer 49. These logical connections are 
achieved by a communication device coupled to or a part of the computer 20; the invention is 

15 not limited to a particular type of communications device. The remote computer 49 may be 
another computer, a server, a router, a network PC, a client, a peer device or other common 
network node, and typically includes many or all of the elements described above relative to 
the computer 20, although only a memory storage device 50 has been illustrated in Fig. 1 . 
The logical connections depicted in Fig. 1 include a local-area network (LAN) 5 1 and a wide- 

20 area network (WAN) 52. Such networking environments are commonplace in offices, 
enterprise-wide computer networks, Intranets, and the Internet. 

When used in a LAN-networking environment, the computer 20 is connected to the 
local network 5 1 through a network interface or adapter 53, which is one type of 
25 communications device. When used in a WAN-networking environment, the computer 20 
typically includes a modem 54, a type of communications device, or any other type of 
communications device for establishing communications over the wide area network 52, such 
as the Internet. The modem 54, which may be internal or external, is connected to the system 
bus 23 via the serial port interface 46. In a networked environment, program modules 
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depicted relative to the personal computer 20, or portions thereof, may be stored in the remote 
memory storage device. It is appreciated that the network connections shown are exemplary 
and other means of and communications devices for establishing a communications link 
between the computers may be used. 

5 

The hardware and operating environment in conjunction with which embodiments of 
the invention may be practiced has been described. The computer in conjunction with which 
embodiments of the invention may be practiced may be a conventional computer, a 
distributed computer, or any other type of computer; the invention is not so limited. Such a 
10 computer typically includes one or more processing units as its processor, and a computer- 
readable medium such as a memory. The computer may also include a communications 
device such as a network adapter or a modem, so that it is able to communicatively couple 
other computers. 

15 System-Level Overview 

Fig. 2 is a block diagram illustrating a system-level overview of an exemplary 
embodiment of the invention. Fig. 2 illustrates source code 205, translator 210, object code 
215, and history data 255, all of which can be contained in system memory 22, as previously 
described in Fig. 1. In one embodiment, translator 210 translates source code 205 into object 

20 code 215 as further described below. Object code 215 creates history data 255 as further 
described below. In another embodiment, translator 210 interprets source code 205 to create 
history data 255, as further described below. 

Source code 205 contains programming language statements that are generally 
25 understandable by a human. Source code 205 contains operand instances 220-1, 220-2, and 
220-3, history operator 225, and history operand 230, which are all programming language 
statements or portions thereof. 
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Operand instances 220-1, 220-2, and 220-3 are programming language symbols 
representing variables, expressions, labels, functions, or memory locations. An operand 
instance can also be a heap-allocated object. Although this example shows three operand 
instances, any number can be present. 

History operand 230 represents a sequence of data associated with the history of 
operand instances 220-1, 220-2, and 220-3, The sequence of data is stored in history data 
255, as described below. Although one history operand is shown in Fig. 2, any number can be 
present. 

History operator 225 represents a function that object code 215 will perform on the 
history data 255, which is represented by history operand 230. Many examples of history 
operator functions are given in Figs. 4-12 and can include any function, such as sum, length, 
average, min, max, mean, trimmed mean, variance, mode, first, last, subsequence, any 
reduction function, and any statistical function. In addition, since history data represents a 
collection, a history operator can include functions for querying and filtering the collection. 
For example, a history operator can include a function that queries whether the value of a 
variable was ever less than zero. Since history data 255 can include information about where 
the definition occurred, history operator 225 can include a query for those assignments that 
occurred at a particular source coordinate (e.g., a query such as "Did this particular 
assignment ever result in x being less than zero?"). Although only one history operator is 
shown in Fig. 2, any number can be present. 

Examples of operand instances, history operators, and history operands are shown 
below in Figs. 4-12. The invention is not limited to the examples shown, and any type of 
operand instance, history operator, and history operand can be used. 

Translator 210 contains history-processing program 250. Translator 210 can be a 

compiler, interpreter, assembler, or a pre-processor for a compiler, interpreter, or assembler. 

10 
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A compiler initially performs lexical analysis on the source code to separate the source 
code into various lexical structures of the programming language (generally known as 
tokens), such as keywords, identifiers, operator symbols, punctuation, and the like. Then, 
5 through syntax analysis, the compiler groups the tokens into various syntax structures of the 
programming language, such as expressions, declaration statements, loop statements, 
procedure calls, and the like. Finally, the compiler generates and (optionally) optimizes 
executable object code for each of these structures. 

10 An interpreter is a program that executes the source code indirectly. The source code 

is translated into an intermediate format known to the interpreter that the interpreter then 
executes. The translation may occur all at once, or piece-by-piece. It is possible that piece- 
by-piece translation may translate the same piece of source code many times or not at all. 

15 An assembler is a program that converts assembly language programs, which are 

understandable by humans, into executable processor instructions. Assembly language is a 
low-level programming language that uses abbreviations or mnemonic codes in which each 
statement typically corresponds to a single processor instruction or a sequence of processor 
instructions. An assembly language and its assembler are typically specific to a given 

20 processor. 

In one embodiment, history-processing program 250 detects history operand 230, 
finds operand instances 220-1, 220-2, and 220-3, and generates object code 215. Object code 
215 contains coded instructions generated by translator 210 that are executable by processing 
25 unit 21, as previously described above. When executed, object code 215 creates history data 
255 and executes history operator 225 on history data 255, as further described below in Figs. 
4-13. The format for history data 255 is described below in Figs. 3A and 3B. 
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In another embodiment, history-processing program 250 detects history operand 230, 
finds operand instances 220-1, 220-2, and 220-3, creates history data 255, and executes 
history operator 225 on history data 255, as further described below in Figs. 4-13. 

In one embodiment, history data 255 contains a sequence of data that saves the history 
of the operand instances as object code 215 executes. History data 255 can contain such data 
as the history of values assigned to variables, the history of values read from variables, the 
history of program execution through various user-defined or language-defined labels, or the 
history of values returned from functions, as further described below. In another 
embodiment, history data 255 represents one or more storage locations or registers used as 
accumulators to process reduction operations (such as sum, count, maximum, minimum, and 
length); the accumulators are then updated at all assignments to a variable. 

Fig. 3A is a block diagram of history data array 305, which is an example format for 
history data 255, as previously described in Fig. 2. Using the terminology of Fig. 2 to explain 
the context and format of history data 305, source code 205 contained history operand "<x>" 
307, which caused history-processing program 250 to either create history data array 305 or to 
generate object code that, when executed, created history data array 305. In this example 
"o" is used surrounding a variable name "x" in order to denote a history operand. But, any 
nomenclature can be used, such as "{}," "[]," "() " "## " "!!," or any other suitable keyword, 
including a keyword not consisting of a matched pair of symbols, as defined by the 
programming language in which source code 205 is written. 

In one embodiment, history-processing program 250 then created object code, which, 

when executed, stored all the values assigned to the operand instance (the variable "x") in 

sequence in history data 305. History data 305 is a one-dimensional array containing the 

sequence of values that were assigned to variable x since the start of execution of object code 

215 until the occurrence of history operator 225 and history operand 230. Thus, <x>(i) 

represents the i th element of <x> (i.e., the i th value assigned to x). Thus, <x>(l) contains 

12 
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valuej 310-1, which is the first value that was assigned to variable x as the program executed. 
Analogously, <x>(2) contains value 2 310-2, which is the second value that was assigned to 
variable x as the program executed, and <x>(n) contains value^ which is the n th value that 
was assigned to variable x as the program executed. Although three values in history data 305 
are illustrated, any number can be present. Although the array in history data 305 begins at 
index 1, any base can be used; for instance, in the C programming language, it would be 
logical for the first element to be at index 0, <x>(0). 

Although not shown in Fig. 3A, in another embodiment history data 305 contains a 
length field or a sentinel at the end of the array, in order to facilitate the implementation of the 
length history operator. 

Although the embodiment shown in Fig. 3 A implements the history data as an array of 
contiguous words in memory, in another embodiment a linked-list is used. In still another 
embodiment, a file is used. 

In yet another alternative embodiment to the array implementation described above, 
one or more accumulators are used for those history operators that fall into the category of 
reduction operations, such as sum, count, max, and min. Implementing reduction operations 
then only requires updating the accumulator at all assignments to the given variable. The 
overhead for using an accumulator is no greater than the overhead of the explicit computation, 
and an accumulator would save memory, which is especially important for those operands 
that have large amounts of history data. 

Although the example of Fig 3 A focuses on values assigned to a variable, history data 

is not so restricted. The history data can also include the sequence of values read from a 

variable, which the programming language can indicate by another keyword defined by the 

programming, for example u [x]." Also, while the example of Fig. 3A focuses on the variable 

"x," any variable can be used. In addition, the history data can contain values other than those 

13 
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associated with variables, such as functions where, for example, "<f>" denotes the sequence 
of values that the function **f ' has returned. Although "f is used here, any function can be 
used. 

5 Fig. 3B is a block diagram of a history data array 350, which further generalizes the 

format of history data 255 shown in Fig. 3A. Using again the terminology of Fig. 2 to explain 
the context and format of history data 350 in Fig. 3B, source code 205 contained history 
operand "<y>", which caused history-processing program 250 to create history data array 
350. In one embodiment, history-processing program 250 then created object code 215, 
10 which, when executed, stored data associated with the variable "y" in sequence in history data 
305. In another embodiment, history-processing program 250 then stored data associated 
with the variable **y" in sequence in history data 305. 

Thus, <y>(l) 355 contains valuei, location^ and timestampi. Valuej is the first value 
15 that was assigned to variable y as the program executed. Location] is the location in source 
code 205 where the first assignment occurred. Timestampi is the time at which the first 
assignment occurred. The location information can be used by query operations to answer 
such questions as "What value was assigned to y the last time it was updated at this source 
coordinate?" or "How many times was variable y updated at source coordinate z?". The 
20 timestamp information - measured, for example, in CPU cycles, elapsed time, or absolute 
time ~ provides valuable profiling information. 

Likewise, <y>(2) 360 contains value 2 , location^ and timestamp 2 . Value 2 is the second 

value that was assigned to variable y as the program executed. Location 2 is the location in 

25 source code 205 where the second assignment occurred. Timestamp 2 is the time at which the 

second assignment occurred. Finally, <y>(3) 365 contains value 3 , location 3 , and timestamp 3 . 

Value 3 is the third value that was assigned to variable y as the program executed. Location 3 is 

the location in source code 205 where the third assignment occurred. Timestamp 3 is the time 

at which the third assignment occurred. Thus, history data 350 is a two-dimensional array 

14 
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containing a sequence of values that were assigned to the variable "y" during execution of 
object code 215 prior to the occurrence of history operator 225 and history operand 230. In 
addition, history data 350 also contains data associated with the assignments. Although three 
values in history data 350 are illustrated, any number can be present. 

5 

Although three columns are shown in Fig. 3B - value, location, and timestamp -- any 
number can be present. For example, in order to support function call histories and tracing, 
history data can include such columns as function arguments and return values, the call site, 
and the time of the call. 

10 

Although the example of Fig 3B focuses on values assigned to a variable, history data 
is not so restricted. The history data can also include the sequence of values read from a 
variable, which the programming language can indicate by another keyword defined by the 
programming, for example "[y] " Also, while the example of Fig. 3B focuses on the variable 
15 "y," any variable can be used. In addition, the history data can contain values other than those 
associated with variables, such as functions where, for example, "<f>" denotes the sequence 
of values that the function "f" has returned. Although "f" is used here, any function can be 
used. 

20 Fig. 4 is a block diagram that illustrates an example of using a program history to 

average the values in a list. Code segment 400 is contained within source code 205. Code 
segment 400 contains code fragments 405 and 450. Code fragment 405 contains a series of 
code statements representing printing out an average of values in a list using conventional 
programming techniques. Code fragment 450 contains a series of code statements 

25 representing printing out an average of values in a list using a program history function. Code 
fragment 450 contains average function 455, which is the history operator. The history 
operand, "<x>" 460 represents the history data, which is the sequence of values assigned to 
the variable "x." The operand instance is "x" 465. 
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As can be seen this example, by eliminating explicit bookkeeping code - such as 
initializing and updating accumulator values shown in code fragment 405- histories make 
programs shorter and simpler. 

5 Fig. 5 is a block diagram that illustrates an example of using a program history to 

count iterations in a programming loop. Code segment 500 is contained within source code 
205. Code segment 500 contains code fragments 505 and 550. Code fragment 505 contains a 
series of code statements representing printing a comma-separated list of values by explicitly 
maintaining a flag or counter to distinguish the first loop iteration from the rest. Code 
10 fragment 550 contains a series of code statements that accomplishes the same function as 
^ code fragment 505 but using a count history operator to count loop iterations. Code fragment 

550 contains count function 555, which is the history operator. The history operand, 
j-ij "<while>" 560 represents the history data, which is the iteration count of the while loop, or 

4] more precisely, the number of times the innermost enclosing while loop has begun executing 

'<!! 

15 its body. The operand instance is the "while" loop keyword 565. 

i H * Fig. 6 is a block diagram that illustrates an example of using a program history to 

compute the maximum value of an element of an array. Code segment 600 is contained 
jllj within source code 205. Code segment 600 contains code fragments 605 and 650. Code 

□ 20 fragment 605 contains a series of code statements representing using an explicitly declared 
variable "max" 607 to find the maximum value of an element of an array. In contrast, code 
fragment 650 contains a series of code statements that accomplishes the same function as 
code fragment 650 but using a max history operator to find the maximum element. Code 
fragment 650 contains max function 655, which is the history operator. The operand instance 
25 is "x" 665. The history operand, "<x>" 660 represents the history data, which is the sequence 
of values that were assigned to the variable "x" between the time that the program began 
executing and the time that history operator 655 was executed. Thus code fragment 650 
avoids the bookkeeping code associated with max variable 607. 
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In one embodiment, the max history operator is computed by having the history 
operand maintain an array of the history data associated with history operand <x> from which 
the maximum can be computed. In another embodiment, the computation is done directly by 
maintaining a single accumulator for the maximum that is updated at every assignment to 
5 operand instance x. This is true for functions that can be computed from accumulator values. 
For example, the "average" history operator can be computed efficiently from two 
accumulators, one for the sum and one for the count. 

Fig. 7 is a block diagram that illustrates an example of using a program history to print 
10 out a list of values read from input. Code segment 700 is contained within source code 205. 
Code segment 700 contains code fragments 705 and 750. Code fragment 705 contains a 
series of code statements representing using an explicitly declared variable "list" 706 to 
contain the input values. In contrast, code fragment 750 contains a series of code statements 
that accomplishes the same function as code fragment 705 but using a length history operator 
15 to find the length of history operand, <x>. Thus, code fragment 750 frees the programmer 
from declaring "list" variable 706. Code fragment 750 contains length function 755-1 and 
index ("[]") function 755-2, which are the history operators. The operand instance is "x" 765. 
The history operands "<x>" 760-1 and 760-2 represent the history data, which is the 
sequence of values that were assigned to the variable "x" between the time that the program 
20 began executing and the time that their respective history operator (755-1, 755-2) is executed. 

Fig. 8 illustrates some uses of histories that do not easily lend themselves to side-by- 
side comparisons of code. Code segment 800 is contained within source code 205. Code 
segment 800 contains code fragments 805, 810, 815, 820, and 825. 

25 

Code fragment 805 contains a series of code statements that determine whether a 

variable has ever been assigned a value, i.e., whether the variable is initialized. Note that 

code to perform this function without using histories would require maintaining a flag at 

possibly many assignment locations — without careful program analysis, all assignment 
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locations would require maintaining the flag. Code fragment 805 contains length function 
806, which is the history operator. Code fragment 805 further contains "<x>" 808, which is 
the history operand. 

5 Code fragment 810 contains a series of code statements that prints the number of 

warning messages issued by reporting the number of calls to the function "warning." Note 
that to maintain this information explicitly would require tedious bookkeeping code that 
would maintain an additional variable. That variable would have to be incremented either at 
all call sites, or within the routine itself. Code fragment 810 contains length function 811, 

10 which is the history operator. Code fragment 810 further contains "<warning>" 813, which is 
the history operand. Thus, the history data is the history of all values returned by function 
calls to the ''warning" function. 

Code fragment 815 contains a series of code statements that counts the number of 
15 input records read using the "gets" function while processing a file. Code fragment 815 
contains length function 816, which is the history operator. Code fragment 815 further 
contains "<gets>" 818, which is the history operand. Thus, the history data is the history of 
the values returned by the "gets" function. 

20 Code fragment 820 contains a series of code statements that resets the history data to a 

null sequence. Programmers can use the reset function when they are interested in less than a 
complete history, such as the values assigned to a variable inside a specific loop. Code 
fragment 820 contains reset function 821, which is the history operator. Code fragment 820 
further contains "<x>" 823, which is the history operand. The reset operator is different from 

25 other history operators in that it changes the history operand, rather than simply computing 
some function based on the history operand. 

Code fragment 825 contains a series of code statements that illustrate a history 

operand that is an expression. An expression is a useful history operand because 
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programmers may not only be interested in the value of a specific variable, but also in the 
history of how two or more variables are related. The semantics of such a history may be that 
a new sequence value is added to the history data whenever the value of either x or y changes. 
Other semantics associated with histories of expressions are also possible. Code fragment 
5 825 contains minimum function 826, which is the history operator. Code fragment 825 
further contains "<x-y>" 828, which is the history operand that is an expression. Although 
the expression "x-y" is shown in this example, any expression can be used. 

Fig. 9 is a block diagram that illustrates an example of using a program history to 
10 count the number of times that program execution has visited a particular location. Code 
segment 900 is contained within source code 205. Code segment 900 contains code 
fragments 905 and 950. Code fragment 905 contains a series of code statements representing 
using explicitly declared variable "counter" 906 to count the number of times a code segment 
has been executed without using histories. In contrast, code fragment 950 contains a series of 
15 code statements that accomplishes the same function as code fragment 905 but using a count 
history operator to count the number of times program execution has visited a code statement 
identified by a user-defined label. When programmers want to refer to the history of a control 
construct, they label the construct. Thus, code fragment 750 frees the programmer from 
declaring a "counter" variable and explicitly incrementing it. 

20 

Code fragment 950 contains count function 955, which is the history operator. The 
history operand is "<label>" 960. Label 965 is the operand instance. The history operand 
"<label>" 960 represents the history data, which is the number of times that the code 
associated with label 965 was executed. 

25 

Fig. 10 is a block diagram that illustrates an example of using a program history to 
print out the number of times that program execution has visited the "then" and "else" 
branches of a compound, conditional statement. Thus, the example of Fig. 10 eliminates the 
need for the user-defined labels of Fig. 9. 
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Fig. 10 illustrates code segment 1000, which is contained within source code 205. 
Code segment 1000 contains code fragments 1005 and 1050. Code fragment 1005 contains a 
series of code statements representing using explicitly declared variables "thenCount" 1006 
and "elseCount" 1007 to count the number of times the "then" and "else" legs of the 
conditional statement have been executed without using histories. In contrast, code fragment 
1050 contains a series of code statements that accomplishes the same function as code 
fragment 1005 but using a count history operator. 

Code fragment 1050 contains count functions 1055-1 and 1055-2, which are the 
history operators. The count functions count the number of times program execution has 
visited the "then" and "else" legs of the conditional "if statement. Language-defined labels 
"<posTest.then>" 1060-1 and "<posTest.else>" 1060-2 are the history operands, each with its 
own history. These language-defined labels allow the programmer to refer to the "then" and 
"else components of the compound "if statement, which begins at label "posTesf . The legs 
1065-1 and 1065-2 of the conditional "if statement are the operand instances. The history 
operands represent the respective history data, which contains the number of times the 
respective program leg was executed. Thus, program fragment 1050 avoids the bookkeeping 
code of explicitly declaring, initializing, and incrementing variables thenCount 1006 and 
elseCount 1007. 

Fig. 1 1 is a block diagram that illustrates an example of using a program history to 
limit loop iterations to avoid diverging computations. Fig. 1 1 illustrates code segment 1 100, 
which is contained within source code 205. Code segment 1 100 contains code fragments 
1 105 and 1 150. Code fragment 1 105 contains a series of code statements that use explicitly 
declared variable "limit" 1 106 to limit loop iterations to avoid diverging computations. In 
contrast, code fragment 1 150 contains a series of code statements that accomplishes the same 
function as code fragment 1 105 but using a count history operator associated with the "while" 
loop. 

20 



MS docket 150948.1; SLWK 777.400usl 



Code fragment 1 150 contains count function 1 155, which is the history operator. The 
count function counts the number of times program execution has begun "while" loop 1 165. 
The operand instance is "while" loop 1 165. The history operand "<while>" 1 160 represents 
5 the history data, which contains the number of times the while loop was executed. 

As can be seen in these examples, program histories provide an effective tool for 
program introspection, profiling, and debugging. With histories, it is very easy for a 
programmer to write a function profiler that reports the number of calls to every function in 
10 the program. Likewise, it is easy to determine how many objects were allocated (e.g., by 
counting calls to malloc), or to determine whether the number of allocations equals the 
number of deallocations. By making these data easier to accumulate, histories enable 
programmers to exploit this information. 

15 Fig. 12 is a block diagram that illustrates an example of using a program history 

restricted to a specified call site. Fig. 12 illustrates code segment 1200, which is contained 
within source code 205. Code segment 1200 contains code statements that search a list for a 
value and report how many comparisons were required before the value was found. 

20 History operand <match:equal> 1260 is the sequence of values returned by the 

"equal" function at location "match." This history limits the history to a call-site, rather than 
the function's global history. The history operator is length function 1255, which returns the 
number of entries in the history data associated with history operand 1260. 

25 Methods of an Exemplary Embodiment of the Invention 

In the previous section, a system level overview of the operation of an exemplary 

embodiment of the invention was described. In this section, the particular methods performed 

by a computer of such an exemplary embodiment are described by reference to a flowchart. 

The methods to be performed constitute computer programs made up of computer-executable 
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instructions. Describing the methods by reference to a flowchart enables one skilled in the art 
to develop such programs including such instructions to carry out the methods on a suitable 
computer where the processor of the computer executes the instructions from computer- 
readable media. 

5 

Fig. 13 is a flowchart illustrating the processing flow of history-processing program 
250 in an exemplary embodiment. Control begins at block 1300. Control then continues to 
block 1305 where history-processing program 250 recognizes a history operand in the source 
code being translated. Control then continues to block 1310 where history-processing 
10 program 250 allocates storage (such as arrays 305 or 350, a list, or a register or storage 
location to be used as an accumulator) to hold the history data associated with the history 
operand. In a two-pass compiler, blocks 1305 and 1310 would typically be executed during 
the first pass while the following blocks would typically be executed during the second pass. 

15 In another embodiment, history-processing program 250 automatically keeps the 

histories for all variables, labels, memory locations, and functions, and objects used by the 
code in order to obviate the need for recognizing history operands and allocating storage for 
only for those recognized. This embodiment would make the history function better suited for 
one-pass compilers, interpreters, and assemblers. 

20 

Control then continues to block 1315 where history-processing program 250 finds all 
instances of the operand within the source code. Control then continues to block 1320 where 
history-processing program 250 generates object code within object code 215 that, when 
executed, will save history data at each instance of the operand. The object code generated at 
25 block 1320 accounts for scoping issues associated with global versus local variables. The 
generated object code also handles dynamic references to memory that cannot be determined 
at compile-time, such as pointer references and dynamic indexing into arrays. In another 
embodiment, history-processing program 250 does not create object code, but instead saves 
history data at each instance of the operand. 
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In the embodiment that uses an accumulator to implement reduction operators such as 
sum, count, max, and min, it is not necessary to save history data at each instance of the 
operand. Instead, implementing reduction operations only requires updating the accumulator 
at all assignments to the given variable. 

Control then continues to block 1325 where history-processing program 250 
recognizes the history operator associated with the history operand. Control then continues to 
block 1330 where history-processing program 250 generates object code that, when executed, 
will perform the history operator on the history data. In another embodiment, history- 
processing program 250 does not generate object code, but instead performs the history 
operator on the history data. 

Control then continues to block 1399 where the function returns. 

Conclusion 

Histories provide programmers with convenient access to past program state. To 
maintain the same information explicitly » without histories « requires substantial effort. 
When programmers must maintain this historical program state (as with the examples above), 
the necessary bookkeeping code is a burden. Program histories liberate programmers from 
these chores. 

Program histories provide an effective tool for program introspection, profiling, and 
debugging. With histories, it is very easy for a programmer to write a function profiler that 
reports the number of calls to every function in the program. Likewise, it is easy to determine 
how many objects were allocated or to determine whether the number of allocations equals 
the number of deallocations. By making these data easier to accumulate, histories enable 
programmers to exploit this information. 
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By eliminating bookkeeping and unnecessary code, program histories improve 
programmer convenience and increase programmer productivity. 
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